Search CORE

124 research outputs found

Adaptation of the human auditory cortex to changing background noise

Author: Herrero J. L.
Khalighinejad B.
Mehta A. D.
Mesgarani N.
Publication venue: Donald and Barbara Zucker School of Medicine Academic Works
Publication date: 01/01/2019
Field of study

Hofstra Northwell Academic Works (Hofstra Northwell School of Medicine)

Estimating and interpreting nonlinear receptive field of sensory neural responses with deep neural network models

Author: Akbari H.
Herrero J. L.
Keshishian M.
Khalighinejad B.
Mehta A. D.
Mesgarani N.
Publication venue: Donald and Barbara Zucker School of Medicine Academic Works
Publication date: 01/01/2020
Field of study

Hofstra Northwell Academic Works (Hofstra Northwell School of Medicine)

Spiking neural network model of cortical auditory source segregation

Author: D Wang
Lakshmi Krishnan
Michael Campos
N Ding
N Mesgarani
S Shamma
Shihab Shamma
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Feature extraction based on bio-inspired model for robust emotion recognition

Author: A Batliner
A Kapoor
AI Iliev
B Schuller
B Yang
C Clavel
C Martínez
C Martínez
D Giakoumis
D Morrison
Diego H. Milone
EM Albornoz
Enrique M. Albornoz
G Chanel
Hugo L. Rufiner
I Luengo Gil
J Adell Mercado
J Kim
J Kim
JR Deller Jr
K Schindler
KP Truong
M Ayadi El
M Wöllmer
N Cummins
N Mesgarani
S Koolagudi
S Shojaeilangari
S Yildirim
SA Shamma
T Chi
X Yang
Y Wang
Z Zeng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/03/2016
Field of study

Emotional state identification is an important issue to achieve more natural speech interactive systems. Ideally, these systems should also be able to work in real environments in which generally exist some kind of noise. Several bio-inspired representations have been applied to artificial systems for speech processing under noise conditions. In this work, an auditory signal representation is used to obtain a novel bio-inspired set of features for emotional speech signals. These characteristics, together with other spectral and prosodic features, are used for emotion recognition under noise conditions. Neural models were trained as classifiers and results were compared to the well-known mel-frequency cepstral coefficients. Results show that using the proposed representations, it is possible to significantly improve the robustness of an emotion recognition system. The results were also validated in a speaker independent scheme and with two emotional speech corpora.Fil: Albornoz, Enrique Marcelo. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Santa Fe. Instituto de Investigación en Señales, Sistemas e Inteligencia Computacional. Universidad Nacional del Litoral. Facultad de Ingeniería y Ciencias Hídricas. Instituto de Investigación en Señales, Sistemas e Inteligencia Computacional; ArgentinaFil: Milone, Diego Humberto. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Santa Fe. Instituto de Investigación en Señales, Sistemas e Inteligencia Computacional. Universidad Nacional del Litoral. Facultad de Ingeniería y Ciencias Hídricas. Instituto de Investigación en Señales, Sistemas e Inteligencia Computacional; ArgentinaFil: Rufiner, Hugo Leonardo. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Santa Fe. Instituto de Investigación en Señales, Sistemas e Inteligencia Computacional. Universidad Nacional del Litoral. Facultad de Ingeniería y Ciencias Hídricas. Instituto de Investigación en Señales, Sistemas e Inteligencia Computacional; Argentin

Crossref

CONICET Digital

Reconstructing Speech from Human Auditory Cortex

Direct brain recordings from neurosurgical patients listening to speech reveal that the acoustic speech signals can be reconstructed from neural activity in auditory cortex

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

FigShare

Non-hexagonal neural dynamics in vowel space

Author: Bruneau N Roux S, Guerin P, et al.
Bullmore ET Suckling J, Overmeyer S, et al.
Chaumon M Bishop DVM, Busch NA
Constaninescu AO O'Reilly JX, Behrens TEJ
Delorme A Makeig S
Doeller CF Barry C, Burgess N
Fox RA
Fyhn M Hafting T, Treves A, et al.
Fyhn M Hafting T, Treves A, et al.
Gay T
Goslin J Galluzzi C, Romani C
Jezek K Henriksen EJ, Treves A, et al.
Khalighinejad B da Silva GC, Mesgarani N
Kropff E Treves A
Maidenbaum S Miller J, Stein JM, et al.
Manca AD Di Russo F, Sigona F, et al.
Mesgarani N Cheung C, Johnson K, et al.
Miller GA Nicely PE
Mäkelä AM Alku P, May PJ, et al.
Näätänen R Picton T
Picton TW Hillyard SA, Krausz HI, et al.
Scharinger M Idsardi WJ, Poe S
Schwartz JL Boë LJ, Vallée N, et al.
Skipper JI Devlin JT, Lametti DR
Staudigl T Leszczynski M, Jacobs J, et al.
Zwicker E
Publication venue: 'American Institute of Mathematical Sciences (AIMS)'
Publication date: 01/01/2020
Field of study

Are the grid cells discovered in rodents relevant to human cognition? Following up on two seminal studies by others, we aimed to check whether an approximate 6-fold, grid-like symmetry shows up in the cortical activity of humans who "navigate" between vowels, given that vowel space can be approximated with a continuous trapezoidal 2D manifold, spanned by the first and second formant frequencies. We created 30 vowel trajectories in the assumedly flat central portion of the trapezoid. Each of these trajectories had a duration of 240 milliseconds, with a steady start and end point on the perimeter of a "wheel". We hypothesized that if the neural representation of this "box" is similar to that of rodent grid units, there should be an at least partial hexagonal (6-fold) symmetry in the EEG response of participants who navigate it. We have not found any dominant n-fold symmetry, however, but instead, using PCAs, we find indications that the vowel representation may reflect phonetic features, as positioned on the vowel manifold. The suggestion, therefore, is that vowels are encoded in relation to their salient sensory-perceptual variables, and are not assigned to arbitrary gridlike abstract maps. Finally, we explored the relationship between the first PCA eigenvector and putative vowel attractors for native Italian speakers, who served as the subjects in our study

Crossref

Sissa Digital Library

NeuroGrid: recording action potentials from the surface of the brain.

Recording from neural networks at the resolution of action potentials is critical for understanding how information is processed in the brain. Here, we address this challenge by developing an organic material-based, ultraconformable, biocompatible and scalable neural interface array (the 'NeuroGrid') that can record both local field potentials(LFPs) and action potentials from superficial cortical neurons without penetrating the brain surface. Spikes with features of interneurons and pyramidal cells were simultaneously acquired by multiple neighboring electrodes of the NeuroGrid, allowing for the isolation of putative single neurons in rats. Spiking activity demonstrated consistent phase modulation by ongoing brain oscillations and was stable in recordings exceeding 1 week's duration. We also recorded LFP-modulated spiking activity intraoperatively in patients undergoing epilepsy surgery. The NeuroGrid constitutes an effective method for large-scale, stable recording of neuronal spikes in concert with local population synaptic activity, enhancing comprehension of neural processes across spatiotemporal scales and potentially facilitating diagnosis and therapy for brain disorders

Crossref

PubMed Central

Apollo (Cambridge)

CUED - Cambridge University Engineering Department

Neural Segregation of Concurrent Speech: Effects of Background Noise and Reverberation on Auditory Scene Analysis in the Ventral Cochlear Nucleus

Author: AK Nabelek
AR Palmer
B Delgutte
BJ May
E Larsen
JF Culling
JF Culling
JPL Brokx
KL Payton
M Sayles
M Sayles
MC Slama
MK Qin
N Mesgarani
PF Assmann
Philip X. Joris
SE Keilson
SF Poissant
WC Sabine
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A Generalized Linear Model for Estimating Spectrotemporal Receptive Fields from Responses to Natural Sounds

Author: A Ramirez
Ana Calabrese
C Machens
C Scharff
CM Carvalho
D Brillinger
D Smyth
D Snyder
David M. Schneider
DJ Klein
DL Donoho
DL Ringach
DM Schneider
E Covey
E Simoncelli
EJ Chichilnisky
FE Theunissen
FE Theunissen
GB Christianson
J Cynx
J Lewi
JD Zevin
JH Friedman
Joseph W. Schumacher
JW Pillow
JW Pillow
L Paninski
L Paninski
L Paninski
L Paninski
Liam Paninski
M Kouh
M Mesgarani
M Sahani
M Schmidt
M Zhao
M. Fabiana Kubke
MA Escabi
MB Ahrens
MB Ahrens
ML Dent
NC Singh
P Gill
R Tibshirani
RF Lyon
S Eldawlatly
Sarah M. N. Woolley
SMN Woolley
SMN Woolley
SMN Woolley
SV David
SV David
SV David
T Park
T Zhang
TO Sharpee
TO Sharpee
W Truccolo
Y Ahmadian
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

In the auditory system, the stimulus-response properties of single neurons are often described in terms of the spectrotemporal receptive field (STRF), a linear kernel relating the spectrogram of the sound stimulus to the instantaneous firing rate of the neuron. Several algorithms have been used to estimate STRFs from responses to natural stimuli; these algorithms differ in their functional models, cost functions, and regularization methods. Here, we characterize the stimulus-response function of auditory neurons using a generalized linear model (GLM). In this model, each cell's input is described by: 1) a stimulus filter (STRF); and 2) a post-spike filter, which captures dependencies on the neuron's spiking history. The output of the model is given by a series of spike trains rather than instantaneous firing rate, allowing the prediction of spike train responses to novel stimuli. We fit the model by maximum penalized likelihood to the spiking activity of zebra finch auditory midbrain neurons in response to conspecific vocalizations (songs) and modulation limited (ml) noise. We compare this model to normalized reverse correlation (NRC), the traditional method for STRF estimation, in terms of predictive power and the basic tuning properties of the estimated STRFs. We find that a GLM with a sparse prior predicts novel responses to both stimulus classes significantly better than NRC. Importantly, we find that STRFs from the two models derived from the same responses can differ substantially and that GLM STRFs are more consistent between stimulus classes than NRC STRFs. These results suggest that a GLM with a sparse prior provides a more accurate characterization of spectrotemporal tuning than does the NRC method when responses to complex sounds are studied in these neurons

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central